Analysis of Bug Triage using Data Preprocessing (Reduction) Techniques

نویسندگان

  • A. Paradkar
  • S. Demeyer
  • E. Giger
  • B. Goethals
  • X. Wang
  • L. Zhang
  • T. Xie
  • J. Anvik
  • Jifeng Xuan
  • A. Alonso-Betanzos
چکیده

In the bug triage we have an unavoidable step of fixing the bugs which helps in correctly assigning a developer to a new bug. Text classification and binary classification techniques are applied to decrease the time cost in manual work and to enhance the working of automatic bug triage. We address the problem of data reduction and hence we combine the instance selection and the feature selection algorithms to simultaneously reduce the data scale and enhance the accuracy of the bug reports in the bug triage. We determine a predictive model to perform the algorithms which adds on to prioritize the developer to a new bug by extracting attributes and the bug data set from the historical table. By leveraging data mining techniques, mining software repositories can uncover interesting information in software repositories and solve real-world software problem like Eclipse, Mozilla and GNOME.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Analysis of Bug Triage using Data Preprocessing (Reduction) Techniques

In the bug triage we have an unavoidable step of fixing the bugs which helps in correctly assigning a developer to a new bug. Text classification and binary classification techniques are applied to decrease the time cost in manual work and to enhance the working of automatic bug triage. We address the problem of data reduction and hence we combine the instance selection and the feature selectio...

متن کامل

Improved Approach for Predicting the Bug Triage Using Data Reduction Methods

Most of the software companies need to deal with vast number of software bugs day to day. This paper can be viewed as an application of instance selection and feature selection in bug repositories. The aim is to address the problem of data reduction for bug triage, and to reduce the scale and improve the quality of bug data. This can be achieved by combining instance selection with feature sele...

متن کامل

Automatic Bug Triaging System using Prediction Algorithm on Rating Basis

Programming associations spend over 45% of cost in overseeing programming bugs. An unavoidable walk of modifying bugs is bug triage, which arrangements to precisely apportion a specialist to another bug. To reduce the time cost in manual work, content request techniques are associated with lead customized bug triage. In this paper, we address the issue of data extenuation for bug triage, i.e., ...

متن کامل

Enhancing Learning from Imbalanced Classes via Data Preprocessing: A Data-Driven Application in Metabolomics Data Mining

This paper presents a data mining application in metabolomics. It aims at building an enhanced machine learning classifier that can be used for diagnosing cachexia syndrome and identifying its involved biomarkers. To achieve this goal, a data-driven analysis is carried out using a public dataset consisting of 1H-NMR metabolite profile. This dataset suffers from the problem of imbalanced classes...

متن کامل

Behavioral Analysis of Traffic Flow for an Effective Network Traffic Identification

Fast and accurate network traffic identification is becoming essential for network management, high quality of service control and early detection of network traffic abnormalities. Techniques based on statistical features of packet flows have recently become popular for network classification due to the limitations of traditional port and payload based methods. In this paper, we propose a metho...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2016